DEVICE AND METHOD FOR ENTERING 
A CHARACTER STRING 

BACKGROUND OF THE INVENTION 
5 1. Field of the Invention 

The present invention relates to a device and a method for entering a 
character string. In particular, the present invention relates to a device and 
a method for entering a character string, in which there are a plurality of 
character strings that are output candidates with respect to the input. 

10 

2. Description of the Prior Art 

The usual method for entering a character string into a computer 
system is to input characters with a keyboard. However, in the case of 
languages with an abundance of characters such as Japanese, it is physically 

15 impossible to implement a number of keys corresponding to all different 
characters. Instead, a "kana" character string or a Latin character string 
(written in "romaji") is typed into the keyboard, and the character string is 
entered by kana— kanji conversion. 

In order to reduce the number of characters that have to be typed, 

20 methods have been proposed, in which characters can be entered using an 
input prediction function, allowing them to be entered without typing in the 
entire character string. In addition, methods have been proposed for 
entering characters without using a keyboard. With such methods, 
characters can be input by using techniques for character recognition of 

25 handwritten characters or voice recognition, for example. Moreover, 
methods have been proposed for entering characters using only the numeric 
keys of portable terminals, hke portable telephones. 

With these methods for entering characters, it is not possible to 
determine a character string unequivocally from the input, so that, by 

30 reference to a suitable dictionary, the appropriate character string is 
determined from a plurality of character strings that are output candidates. 

For example, Pubhcation of Unexamined Japanese Patent 
Application (Tokkai) No. Sho 61—32186 discloses a method using dictionaries 
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for individual pattern recognition and dictionaries for context processing 
that are customized for each user. By using a specialized dictionary for 
each user, the precision of the character string recognition can be enhanced. 

Tokkai Hei 9—179859 discloses a method wherein several conversion 
modes are prepared beforehand, and the candidate character string is 
determined by selecting a conversion mode that is suitable with respect to 
the situation when the character string processing device is used. 

Moreover, Tokkai Hei 11—3331 discloses a method wherein the field of 
the text is specified on the basis of the character strings that are being 
entered in the text, and candidate character strings are determined using a 
dictionary corresponding to this field. 

Thus, the candidate character string corresponding to an input is 
determined by reference to a dictionary, and can vary slightly depending on 
the content of the dictionary. For example, if only the input of a specified 
field is allowed, it is possible to produce a dictionary whose content matches 
the input of the user. 

However, if the input is unspecified, and the content of the dictionary 
does not match the user's input, there is the problem that the selection 
efficiency for candidate character strings does not increase, and the hit ratio 
for input character strings does not increase. For example, even when a 
dictionary has been prepared beforehand, it cannot accommodate 
abbreviations that the user has newly created, new product names or other 
proper nouns, etc. 

Also, dictionaries that can be filled so as to sufficiently accommodate 
unspecified input are conceivable, but that will increase the size of the 
dictionary, and consume valuable computer resources, so that there is the 
problem that handwritten character recognition of online pen input in small 
terminals, such as PDAs (personal digital assistants), cannot be 
accommodated physically. 

For example, the method disclosed in Tokkai Sho 61—32186 attempts 
to raise context sensitivity by providing customized dictionaries serving as 
context— sensitive dictionaries, for each user, but it has the drawback that it 
cannot adapt flexibly to changes in a usage sir nation. 



The method disclosed in Tokkai Hei 9-179859 attempts to raise the 
hit ratio for candidate character strings by selecting a conversion mode for a 
situation to determine a candidate character string. However, it is only 
possible to select from conversion modes that have been prepared beforehand, 
5 this method cannot adapt to unpredictable situation. Also, the selection of 
the conversion mode is left to the user, and there is no guarantee that the 
user will select the appropriate conversion mode. 

Moreover, it is conceivable to store many conversion modes in order 
to adapt to many situations, but this will consume a lot of valuable computer 
10 resources, so that this is physically impossible in small terminals. 

Moreover, in the method disclosed in Tokkai Hei 11-3331, the field of 
ih the currently edited text is specified, and a dictionary for the specified field is 

:t: selected, but here the fields themselves have to be specified beforehand, and 

• Ji in a situation such as entering a character string to produce a short message 

15 with, for example, a portable telephone, it is generally difficult to specify 
''^ such fields. Also, to specify such fields, field-specific dictionaries with 

sufficient capacity are necessary, which will consume a lot of valuable 
computer resources, so that this is physically impossible in small terminals. 

20 SUMMARY OF THE INVENTION 

It is an object of the present invention to solve the problems of the 
prior art, and to provide a device and a method for entering a character 
string, in which a dictionary is changed dynamically in accordance with the 
input situation of the user, and in which character strings that are generally 

25 suitable for the input situation of the user can be entered. 

In order to achieve these objects, a device for entering a character 
string of the present invention comprises an input part for entering a 
character string; an input situation acquiring part for acquiring a situation 
for entering a character string; a situation control part for affirming a 

30 dictionary used for generating a candidate character string or a part of such 
a dictionary in accordance with a situation acquired with the input situation 
acquiring part and designating it as situation— optimized dictionary; a 
candidate character string generation part for generating and outputting an 
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output candidate character string that is optimal for the situation in 
response to a character string that is entered with the input part, using the 
situation— optimized dictionary designated by the situation control part; a 
candidate character string affirmation processing part for affirming the 
5 outputted candidate character string; and an affirmed character string 
storing part for storing a character string that has been affirmed with the 
affirmation processing part in the situation-optimized dictionary designated 
by the situation control part. 

With this configuration, character strings that the user has affirmed 
10 as input character strings can be stored in a situation— optimized dictionary 
in adaptation of the situation when the input takes place, so that a 
rji situation— optimized dictionary in accordance with the input situation of the 
'2: user can be changed dynamically, and the input of optimum character 

ihj strings for an input situation becomes possible. 

v{u 15 Moreover, in the device for entering a character string of the present 

^ invention, it is preferable that situations acquired by the input situation 

acquiring part comprise at least one information selected firom the group 
^^r:; consisting of information relating to a character string processing device to 

pi which the output candidate character string is given; information relating to 

20 a text that the character string processing! device, to which the output 
candidate character string is given, can output; information relating to a 
position in a text that the character string processing device, to which the 
output candidate character string is given, can output; and information 
relating to a processing mode with which a character string that has been 
25 given to the character string processing device is processed. 

Information relating to a character string processing device to which 
the output candidate character string is given is useful because depending on 
the differences between character string processing devices, the character 
strings to be outputted may be limited to a certain extent, depending on 
30 whether a schedule is entered, or an electronic mail is entered, etc. 
Information relating to a text that the character string processing device, to 
which the output candidate character string is given, can output is useful 
because the character strings to be outputted also may be limited to a certain 
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extent, depending on the text that can be output. Also, information relating 
to a position in a text that the character string processing device, to which 
the output candidate character string is given, can output is useful because 
the character strings to be outputted also may be limited to a certain extent 
5 depending on the text that can be output, depending on the field to be output. 
And, information relating to a processing mode with which a character string 
that has been given to the character string processing device is processed is 
useful because the character strings to be outputted also may be limited to a 
certain extent depending on whether the processing mode is, for example, 
10 the search mode or the storage mode. 

Moreover, in the device for entering a character string of the present 
invention, it is preferable that the situation control part selects the 
; P situation— optimized dictionary from a plurality of dictionaries, in accordance 
Cji with the situation that has been acquired with the input situation acquiring 

I J; 15 part. This is useful when there is more than one situation-optimized 

Jj dictionaries, because then a situation-optimized dictionary can be selected 

that is optimal for a situation. 
rUi Moreover, in the device for entering a character string of the present 

J'.^l invention, it is preferable that the situation control part changes a method 

20 for generating the situation— optimized dictionary from a pluraHty of 
dictionaries, in accordance with the situation that has been acquired with 
the input situation acquiring part, because this way, an optimal candidate 
character string generation method can be selected. 

Moreover, in the device for entering a character string of the present 
25 invention, it is preferable that the affirmed character string storing part 
stores a storage date of an affirmed character string as a last— access date 
when storing the affirmed character string; the date when a character string 
that is already stored is accessed is used to change the last— access date; and 
after calculating, from the stored last— access date and the current date, a 
30 period of time that a character string has not been used, character strings 
which have not been used for a certain period of time are deleted from the 
situation— optimized dictionary. With this configuration, character strings 
that have not been used for a certain period of time can be deleted when they 
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are deemed to be of no reuse value for the user, so that the capacity of the 
situation— optimized dictionary does not grow excessively. 

Moreover, in the device for entering a character string of the present 
invention, it is preferable that the affirmed character string storing part 
5 divides an affirmed character string into units of a necessary lower limit, 
which can be optimized. Thus, by taking short character strings, the hit 
ratio can be increased, and the processor time necessary for the affirmation 
can be reduced. 

Moreover, it is preferable that the device for entering a character 
10 string of the present invention further comprises a situation—optimized 
dictionary production part for producing a situation— optimized dictionary by 
i J treating character strings that are used in a pre— existing electronic text in 

; the same manner as affirmed character strings. Thus, input strings can be 

Sii input with a certain degree of reliability even when the character input 

15 device is used for the fixst time. 
''4 In a computer— readable recording medium storing a program, to be 

|\ executed on a computer, the program comprising steps for entering a 

character string; acquiring a situation for entering a character string; 
i n: affirming a dictionary used for generating a candidate character string or a 

■^^ 20 part of such a dictionary in accordance with an acquired situation 

designating it as situation— optimized dictionary; generating and outputting 
an output candidate character string that is optimal for a situation in 
response to an entered character string, using the designated situation- 
optimized dictionary; affirming the outputted candidate character string; 
25 and storing an affirmed character string in a situation— optimized dictionary. 

With this configuration, character strings that the user has affirmed 
as input character strings can be stored in a situation— optimized dictionary 
in adaptation of the situation when the input takes place, by loading and 
executing the program on a computer. Thus, a situation— optimized 
30 dictionary in accordance with the input situation of the user can be changed, 
and input of optimum character strings for an input situation becomes 
possible. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a diagram of a first embodiment of a device for entering a 
character string in accordance with the present invention. 

Fig. 2 shows an example of the content in a situation— optimized 
5 dictionary in the first embodiment of a device for entering a character string 
in accordance with the present invention. 

Fig. 3 shows an example of the content in a situation— optimized 
dictionary in the first embodiment of a device for entering a character string 
in accordance with the present invention. 
10 Fig. 4 shows an example of the content in a situation— optimized 

dictionary in the first embodiment of a device for entering a character string 
^ J in accordance with the present invention. 

;H Fig, 5 shows an example of the content in a situation— optimized 

■ dictionary in an embodiment of a pattern recognition device in accordance 

15 with the present invention. 
S]| Fig. 6 is a flowchart for the first embodiment of a device for entering 

L a character string in accordance with the present invention, 

i W Fig. 7 is a diagram of a pattern recognition device in accordance with 

J pi the present invention. 

20 Fig. 8 shows an example of a handwritten character input pattern. 

Fig. 9 shows examples of recognized candidate characters. 
Fig. 10 is a flowchart of the process at the start of the recognition 
operation. 

Fig. 11 shows an example of the content in the situation— optimized 
25 dictionary of an embodiment of a pattern recognition device in accordance 
with the present invention. 

Fig. 12 shows an example of the, calculated estimate values for 
recognized candidate character strings. 

Fig. 13 shows an example of a handwritten character input pattern. 
30 Fig. 14 shows an example of the content in the situation— optimized 

dictionary of an embodiment of a pattern recognition device in accordance 
with the present invention. 

Fig. 15 shows examples of recognized candidate characters. 
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Fig. 16 shows an example of the calculated estimate values for 
recognized candidate character strings. 

Fig. 17 shows an example of the content in the situation-optimized 
dictionary of an embodiment of a pattern recognition device in accordance 
with the present invention. 

Fig. 18 shows an example of the calculated estimate values for 
recognized candidate character strings. 

Fig. 19 shows an example of a process for correcting a recognized 
candidate character string. 

Fig. 20 shows an example of a process for correcting a recognized 
candidate character string. 

Fig. 21 shows an example of a process for correcting a recognized 
candidate character string. 

Fig. 22 shows an example of a process for correcting a recognized 
candidate character string. 

Fig. 23 is a flowchart for an embodiment of pattern recognition device 
in accordance with the present invention. 

Fig. 24 shows examples of recording media. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
First Emb odiment 

The following is a description of a first embodiment of a device for 
entering a character string in accordance with the present invention, with 
reference to the accompanying drawings. Fig. 1 is a block diagram of the 
first embodiment of a device for entering a character string in accordance 
with the present invention, which can be applied, for example, to a portable 
telephone. 

In Fig, 1, numeral 11 denotes an input part pertaining to the numeric 
buttons of a portable telephone, numeral 12 denotes an input situation 
acquiring part, numeral 13 denotes a situation control part, numeral 14 
denotes a situation— optimized dictionary, numeral 15 denotes a candidate 
character string generation part, numeral 16 denotes a candidate character 
string affirmation part, numeral 17 denotes an affirmed character string 
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storing part, and numeral 2 denotes a character string processing device for 
outputting candidate character strings. 

In Fig. 1, a character string to be entered into the character string 
processing device 2 is entered with the input part 11. What is entered in 
5 this first embodiment is a signal string, which is generated by pushing down 
the numeric buttons, i.e. a numeric string. The entered numeric string is 
sent to the candidate character string generation part 15. 

Japanese uses a complex writing system of Chinese characters (also 
referred to as "Kanji" in the following) and two phonetic syllabic alphabets 
10 (also referred to as "Kana" in the following), each of the two phonetic syllabic 
alphabets comprising ten rows, as shown in Table 1: 



a— row 


ka- 
row 


sa— row 


ta— row 


na— 
row 


ha- 
row 


ma— 
row 


ya— row 


ra— row 


wa— row 


a 


ka 


sa 


ta 


na 


ha 


ma 


ya 


ra 


wa 


i 


ki 


shi 


chi 


ni 


hi 


mi 




ri 




u 


ku 


su 


tsu 


nu 


fu 


mu 


yu 


ru 


n 


e 


ke 


se 


te 


ne 


he 


me 




re 




o 


ko 


so 


to 


no 


ho 


mo 


yo 


ro 


wo 



Table 1 



Syllables including other consonants can be rendered with the use of 
15 diacritical marks: For example, the kana characters of the "ga"— row are 
identical to those of the "ka"— row, but are written with the addition of a 
diacritical mark. Thus, the kana character "gi", for example, is written as 
"ki" + diacritical mark. The first embodiment requires no distinction for 
characters with diacritical marks, and the process for entering characters 
20 with diacritical marks is the same as the process for entering characters 
without diacritical marks. Therefore, the process for entering the kana 
character "gi", for example, is identical to the process for entering the kana 
character "ki". 

In the first embodiment, the characters from the "a"— row to the "ra"— 
25 row are assigned to the numbers 1 through 9 of the portable telephone, 
respectively, and the "wa"-row and "n" are assigned to the number 0 (zero). 
Thus, any Japanese character string can be entered just by entering the 
numeric keys. 



When the character string processing device 2 has been started and 
characters are entered, the input situation acquiring part 12 acquires the 
input situation frora the character string processing device 2 into which the 
character string is given. Here, the "input situation" depends for example 
5 on what kind of device the character string processing device 2 is (for 
example, a schedule inputting device, an electronic mail inputting device, 
etc.), what kind of text is entered into the character string processing device 
2, what the current position in the text is (for example, the "mail address" 
field in a mail that is being edited), and what mode the character string 

10 processing device 2 is in (in a telephone number processing device, for 
example, search mode, storage mode, etc.). 

For example, if a schedule inputting device has been started as the 
character string processing device 2, the input situation acquiring part 12 is 
notified by the schedule inputting device that the schedule inputting device 

15 has been selected as the character string processing device 2. Moreover, 
when a change is made to the effect that an input can be made to a 
simultaneously running electronic mail inputting device, the input situation 
acquiring part 12 is notified by the electronic mail inputting device that the 
electronic mail inputting device has been selected as the character string 

20 processing device 2. 

The character string inputting device itself can also acquire 
information from the character string processing device 2, for example by 
acquiring information such as about a program running on the currently 
active window. 

25 Also, when the character string processing device 2 is in the search 

mode, the object being searched is also acquired as part of the input situation. 

The acquired input situation is sent to the situation control part 13. 
Based on the acquired information, the situation control part 13 assumes a 
candidate character string generation state according to the situation. 

30 Here, "candidate character string generation state" means that the 
situation— optimized dictionary 14 is changed in accordance with the 
situation and the method for generating candidate character strings is 
changed in accordance with the situation. 
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As a method for updating the situation-optimized dictionary 14 in 
accordance with the situation, it is possible to provide different situation- 
optimized dictionaries 14 for different situations, and to switch between 
these situation— optimized dictionaries 14. Or, instead of switching between 
dictionaries, it is also possible to employ a method for updating a priority 
order, and to provide a field specifying in which situation each item in the 
situation— optimized dictionary 14 has been used and to use only items of the 
field corresponding to the situation, or to change the priority order in 
accordance with a field that corresponds to the situation. 

This example relates to the character input in, for example, a 
portable telephone or a PHS. Conventionally, character input in portable 
terminals, like portable telephones and PHS devices is performed by using 
the dial buttons, wherein "1" corresponds to the "a"-row, "2" corresponds to 
the "ka"— row, etc. The vowel of each syllable is decided by the number of 
times that the same dial button is pressed. For example, in order to enter 
the Japanese word "o— ku— ri— ma— su", "1" is pressed 5 times, "2" is pressed 3 
times, "9" is pressed 2 times, "7" is pressed 1 time, and "3" is pressed 3 times 
(see Table 1). Thus, to enter five Japanese kana characters, it is necessary 
to press the buttons 14 times. 

To improve this situation, the simpUfied input method "T9" has been 
introduced recently. With "T9", the dial button "1" of the portable telephone 
or PHS device corresponds to the "a"-row, button "2" corresponds to the 
"ka"— row, etc., which is the same as above, but an internal language 
database is provided, and depending on the pressed buttons, variations of 
the stored words are scanned, and the intended word can be selected 
instantly. Thus, when entering the Japanese word "o— ku-ri-ma— su", it is 
sufficient to enter "12975". 

But however convenient "T9" may be, it is far fi:om perfect. That is 
to say, for short words, the number of selected variations grows massively, so 
that there is the inherent problem that the possibility increases that 
unintended words are selected. 

Now, the first embodiment Hmits the words that are selected by "T9" 
according to the input situation. For example. Figs. 2(a) and 3(a) show the 
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content of situation— optimized dictionaries 14 for two different situations. 
Fig. 2 illustrates the situation where the user inputs a schedule (called 
"situation 1" in the following), whereas Fig. 3 illustrates the situation where 
the user inputs a short message (called "situation 2" in the following). 
5 In the situation-optimized dictionary 14 for situation 1, character 

strings are stored that have been entered when using the character string 
processing device 2 to enter a schedule. Similarly, in the situation- 
optimized dictionary 14 for situation 2, character strings are stored that 
have been entered when using the character string processing device 2 to 

10 enter a short message. 

When the numeric buttons are pressed in the order "212" in situation 
1, a candidate character string is generated on the basis of the situation- 
optimized dictionary 14 for situation 1, so that the candidate character string 
"ka— i-gi" (which is the Japanese word for "conference") is output into the 

15 character string processing device 2 for entering a schedule. 

On the other hand, when the numeric buttons are pressed in the 
same order "212" in situation 2, a candidate character string is generated on 
the basis of the situation— optimized dictionary 14 for situation 2, so that the 
candidate character string "ke-i~ko" (which is a Japanese first name) is 

20 output into the character string processing device 2 for entering a short 
message. 

If the character string processing device 2 is in the search mode, a 
possible method for updating the method for generating a candidate 
character string in accordance with the situation is to take the search object 

25 acquired with the situation acquiring part 12 to change the character strings 
in the situation— optimized dictionary 14 so that they are generated as 
candidate character strings. Moreover, when the character string 
processing device 2 is in the input mode, assuming that a character string is 
entered that is not in the situation— optimized dictionary 14, it is possible to 

30 generate even character strings that are not in the situation— optimized 
dictionary 14 as a candidate character string. 

A regular standard dictionary is included during installation, since it 
is not possible to change the situation-optimized dictionary 14 dynamically, 



12 



if the situation— optimized dictionary 14 does not fulfill any dictionary 
functions at the begin of the use of the device for entering a character string 
of the first embodiment. Consequently, to generate character strings that 
are not in the situation— optimized dictionary 14, this standard dictionary is 
5 used. 

Fig. 4 shows an example of the content of the situation— optimized 
dictionary 14 when the user searches a name in the telephone directory 
stored in the portable telephone to make a phone call. In this case, the 
content of the situation-optimized dictionary 14 matches with what the 
10 situation acquiring part 12 has acquired as the stored content of the 
telephone directory. This means, if the numeric buttons are pressed in the 
order "212", the candidate character string "ko-i-ke" (which is a Japanese 
P family name) is given as the output in the mode of name searching the 

telephone directory 

]^ 15 In the candidate character string generation state, which is set by 

SJi the situation control part 13, the candidate character string generation part 

15 generates an optimal candidate character string from the entered 

!'y character string, in accordance with the situation, and sends it to the 

character string processing device 2, 

•Si 20 The optimal candidate character string sent to the character string 

processing device 2 is output by an output part in the character string 
processing device 2. Incidentally, an optimal candidate character string 
does not necessarily have to be determined unequivocally, and it is also 
possible that more than one optimal candidate character strings are 
25 generated after performing an optimum ranking, and that the user selects 
one candidate character string with the character string processing device 2. 

With respect to the candidate character string outputted by the 
output part of the character string processing device 2, the user performs an 
affirmation command with the candidate character string affirmation part 
30 16. "Affirmation command" means that the user gives a command which 
affirms an output character string, which can be done, for example, by 
selecting a certain candidate character string from a pluraUty of candidate 
character strings. It is also possible that the user does not undertake any 

13 



particular declaratory operation, but that the affirmation is taken to be 
made when the user goes on to the next input operation. 

When the user has given the command to affirm the candidate 
character string, the candidate character string affirmation part 16 sends 
the affirmed character string to the situation control part 13. The situation 
control part 13 sends the current input situation and the affirmed character 
string to the affirmed character string storing part 17. Then, the afl5.rmed 
character string storing part 17 stores the affirmed character string with the 
situation-optimized dictionary 14 in accordance with the current input 
situation. 

Fig. 5 shows an example of the situation-optimized dictionary 14. 
In Fig. 5, the stored items include the numerical string entered, the stored 
character string, the date that the character string has been used last, and 
the number of times that the character string has been used (also referred to 
as "recurrence") is stored. The date that the character string has been 
accessed last and the number of times that the character string has been 
accessed are renewed when the affirmed character string storing part 17 
stores the affirmed character string. It should be noted that in addition to 
the date, it is also possible to include the time that character string has been 
accessed last. 

Based on the date of the last access of the character string, it is 
possible to generate the candidate character string with priority given to 
recently accessed character strings, or with priority given to character 
strings with a access frequency. It goes without saying that it is also 
possible to combine these two factors, and to calculate an estimate value by 
weighting the priorities. 

Moreover, based on the date of the last access of the character string, 
character strings that have not been accessed for a certain period of time are 
deleted from the situation-optimized dictionary 14. For example, character 
strings that have been affirmed by error or have been accessed accidentally 
only once can be deleted after a certain period of time, so that data that is 
unnecessary to generate candidate character strings can be deleted from the 
situation-optimized dictionary 14, and the capacity of the dictionary can be 
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used economically. 

If the divisional units for the character strings are defined by 
separating words by "spaces" as in English for example, it is possible to store 
word units by separation with spaces. However, in languages in which the 
5 division into word units is difficult, such as in Japanese, it is preferable to 
perform the division into words after a morphological analysis or with a 
suitable method such as taking the point where the type of character (for 
example, kanji and kana characters) changes as the separation. 

As an example, let's examine the Japanese word 

10 "pataanninshikisouchi", which means "pattern recognition device" in EngUsh. 
The part "pataan" of this word is written in katakana characters, i.e. one of 
the two Japanese types of syllabic kana characters, whereas the part 
"ninshikisouchi" of the word is written in kanji characters. When the word 
"pataanninshikisouchi" is divided into character strings, it can be divided by 

15 morphological analysis into "pataanninshiki" (pattern recognition) and 
"souchi" (device). Alternatively, if it is divided where the type of character 
changes (here, from kana characters for "pataan" into kanji characters for 
"ninshikisouchi"), it can be divided into "pataan" and "ninshikisouchi". 

There is a certain physical Umit to the situation— optimized 

20 dictionary 14 in that it is limited by computer resources. That is to say, it is 
not possible to store more than a certain amount of data. Based on the date 
of the last access of the character strings, the character strings with the 
oldest date of last access can be deleted successively from the situation- 
optimized dictionary 14, so that it is possible to maintain the maximum 

25 storage number of character strings or the maximum storage capacity of the 
situation— optimized dictionary 14 constant. 

By adding the time of last access to the date, an even more precise 
management of the stored character strings becomes possible. It is also 
possible to delete character strings from the situation-optimized dictionary 

30 14 dynamically after a certain period of time has elapsed, or when a new 
word is input and stored in the situation-optimized dictionary 14. 

It is preferable that the user can set the maximum storage number or 
the maximum storage capacity freely, depending on the computer resources. 

15 



It is also possible to provide further a situation— optimized character 
string dictionary production part for producing a situation— optimized 
dictionary 14 by treating character strings that are used in a pre— existing 
electronic text in the same manner as affirmed character strings in the 
5 situation— optimized dictionary 14. In this case, whereas it is a fact that a 
user who uses the character string input device in a certain situation for the 
first time can hardly expect a candidate character string to be generated in 
accordance with this situation, it becomes possible to a certain degree to 
generate candidate character strings in accordance with the situation from 

10 the first time they are used. 

The following is an explanation of the process flow of a program for 
realizing the first embodiment of a device for entering a character string in 
accordance with the present invention, with reference to Fig. 6. 

In Fig. 6, when a character string is entered by the user (step S61), 

15 the situation when the character string is entered, that is, information on 
what kind of device the character string processing device 2 is (for example, a 
schedule inputting device, an electronic mail inputting device, etc.), what 
kind of text is entered into the character string processing device 2, what the 
current position in the text is (for example, the "mail address" field in a mail 

20 that is being sent out), and what mode the character string processing device 
2 is in (in a telephone number processing device, for example, search mode, 
storage mode, etc.) is acquired (step S62). Then, depending on the acquired 
situation, the situation-optimized dictionary 14 is changed and referenced 
dynamically (step S63). 

25 By selecting and referencing the situation-optimized dictionary 14 in 

accordance with the situation, an optimal candidate character string is 
generated in accordance with the situation (step S64) and is output as the 
candidate character string by the character string processing device 2 (step 
S65). 

30 Then, after performing a process in which the user affirms the input 

character string (step S66: yes), the affirmed character string is stored in the 
situation-optimized dictionary 14 (step S67), so that it can be applied for 
processing from the next time onward. 
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Thus, with this first embodiment, by dynamically renewing the 
contents of the situation— optimized dictionary 14 in accordance with the 
situation in which a character string is being entered, it is possible to achieve 
the optimum content for the input situation, and by performing the 
5 processing for the generation of the candidate character strings in 
accordance with the situation using the situation— optimized dictionary 14, it 
is possible to enter character strings with a high hit ratio. 

Second Embodiment 

10 The following is an explanation of a second embodiment of a device 

for entering character strings in accordance with the present invention, with 
reference to the accompanying drawings. Fig. 7 is a block diagram of the 
second embodiment of a device for entering a character string in accordance 
with the present invention, which can be applied, for example, to a portable 

15 computer. In this example, character strings are entered by pattern 
recognition of, for example, handwritten characters. In Fig. 7, numeral 70 
denotes a pattern input part comprising a handwritten input part accepting 
input with a pen on a part of a display screen, numeral 71 denotes a pattern 
recognition part, numeral 72 denotes a dictionary for pattern recognition, 

20 numeral 73 denotes a candidate character string generation part, numeral 
74 denotes a situation— optimized dictionary, numeral 75 denotes a situation 
control part assuming candidate character string generation status 
depending on the situation, numeral 76 denotes an input situation acquiring 
part, numeral 77 denotes a candidate character string affirmation part, 

25 numeral 78 denotes an affirmed character string storing part, and numeral 
79 denotes a display part including the previously described handwritten 
input part. Moreover, a dictionary managing part 80, which is a 
characterizing feature of the second embodiment, includes the candidate 
character string affirmation part 77 and the affirmed character string 

30 storing part 78, which is provided with a date renewal part 81 and a 
recurrence renewal part 82. 

In Fig. 7, a pattern to be recognized is entered with the pattern input 
part 70. In this first embodiment this is a handwritten character string. 
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The entered pattern is sent to the pattern recognition part 71. 

The second embodiment is explained by way of an example where the 
word "shachou" is entered as a handwritten pattern. The Japanese word 
"shachou", which means "company president" in English, consists of two 
5 kanji characters, as is shown in Fig. 8. In Fig. 8, the left kanji character is 
read "sha" and the right kanji character is read "chou". Of course, the 
present invention is not hmited to this method of input, and other methods of 
input, such as voice input, are also possible. Fig. 8 illustrates the case that 
the input is performed character by character with handwriting within 
10 specified frames, but there is no particular limitation to this, and the input 
can also be a handwritten character string in an input section without 
frames. 

The pattern recognition part 71 looks up the entered pattern, with 
reference to the dictionary for pattern recognition 72, and outputs recognized 

15 candidate characters in order of exactitude to the candidate character string 
generation part 73. To extract the recognized candidate characters, it is 
possible to use a commonly used OCR engine or onhne recognition engine, or 
any other kind of recognition engine. To calculate the exactitude, several 
methods are possible, for example, to calculate the distance between the 

20 entered pattern and the pattern of the dictionary for pattern recognition 72, 
or to make a calculation based on characteristic features. Fig. 9 shows an 
example of the recognition results for the case that the characters for 
"shachou" are entered by handwriting as shown in Fig. 8. In Fig. 9, the 
numbers listed next to the recognized candidate characters express the 

25 exactitude of the corresponding recognized candidate character, and the 
larger this number is, the higher is the exactitude. 

Based the result of the recognition obtained with the pattern 
recognition part 71, the candidate character string generation part 73 
determines a pattern— recognized candidate character string that is optimal 

30 for the entered pattern, using the situation— optimized dictionary 74. 

As is shown in the flowchart of Fig. 10, the situation-optimized 
dictionary 74 is read in from an external memory to an internal memory 
when the recognition operation begins. At first, the recognized candidate 
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character strings are compared with the character strings stored in the 
situation— optimized dictionary 74, which has been read into the internal 
memory. Based on the exactitude of the recognized candidate character 
strings with respect to the character strings in the situation— optimized 
dictionary 74, estimate values are calculated for the recognized candidate 
character strings. After comparison of the calculated estimate values, the 
recognized candidate character string with the highest estimate value is 
selected. Thus, an optimal pattern— recognized candidate character string 
can be determined when there is more than one candidate character string. 
Moreover, taking into account the date of the last access or the number of 
accesses, which are stored in the situation— optimized dictionary 74, the 
reliability of the estimate value with respect to the recognized candidate 
character strings can be enhanced, and a user— adapted recognized candidate 
character string can be output as the optimum pattern— recognized candidate 
character string. 

The following explains an example of the calculation of the estimate 
value. A recognized candidate character string W that is output from the 
pattern recognition part 72 can be expressed by a combination of n candidate 
characters Ci • • ... • (wherein n is a positive integer). Let Sj be the 
exactitude for a recognition candidate character Q. If the recognized 
candidate character string W is in the situation— optimized dictionary 74, 
then the estimate value Z can be expressed by 



Here, a is a factor that shows how the fact that the recognized 
candidate character string W is in the situation— optimized dictionary 74 is 
weighted, and which can be changed by the user. 

If the recognized candidate character string W is not in the 
situation-optimized dictionary 74, the estimate value Z can be expressed by 



n 



Eq. 1 
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Eq. 2 



By determining the estimate value Z with Eq. 1 or Eq. 2, a 
comprehensive estimate including the exactitude from the pattern 
5 recognition part 72 is possible, even if the combination of recognized 
candidate characters can be found incidentally in the situation-optimized 
dictionary 74. Thus, it is possible to output an adequate character string as 
the recognized candidate character string. 

If the recognized candidate character string W is in the situation— 
10 optimized dictionary 74, it is also possible to express the estimate value Z by 



(a ^ 0, a O) 

Here, freq(W) indicates the access frequency of the recognized 

15 candidate character string W stored in the situation-optimized dictionary 74, 
and passtime(W) indicates the time that has passed since the recognized 
candidate character string W has been accessed for the last time. Moreover, 
f(freq(W)) is a function of the access frequency, and g(p ass time (W)) is a 
function of the passed time. By weighting the functions f and g with the 

20 factors a and )3 , it is possible to adjust freely with what weighting the 
access frequency and the passed time are evaluated. 

For example, if, as shown in Fig. 11, the recognized candidate 
character strings "shachou" (which means company president in Japanese) 
and "rei— wo" (which means "thanks" in Japanese) are stored, and the 

25 estimate value is calculated with Eq. 1 for a = 30, then, as shown in Fig. 12, 
an estimate value Z = 1567 is obtained for the character string "shachou", by 
adding the sum of the exactitude 748 for "sha" with the exactitude 759 for 
"chou" to number of characters 2 x a = 60. Similarly, the estimate value 
for the character string "rei— wo" is calculated to Z = 1500. Consequently, 

30 the character string "shachou", which has the higher estimate value, is 
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recognized. 

It is also possible that handwritten characters are entered as English 
text, as shown in Fig. 13. Similar to the case of Japanese character strings, 
the following is an example of the calculation of the estimate value with Eq. 
5 3, when the two EngUsh recognized candidate character strings "hat" and 
"hut" are stored in the situation— optimized dictionary 74 as shown in Fig. 14. 

Based on the recognition result shown in Fig. 15, for the character 
string "hat", the sum of the exactitude 858 for the letter "h", the exactitude 
839 for the letter "a", and the exactitude 888 for the letter "t" is calculated to 
10 be 2585. 

Here, it is assumed that a = 5, )3 = 10, f(l) = 1, f(2) = 2. Moreover, 
the date of execution is January 6, 1999, and if the date of the last access is 
January 3, 1999, then the passed number of days takes on the value "3", 
whereas it would be "0" if the date of the last access is January 6, 1999. 
15 With these values as arguments, it is assumed that g(0) = 10 and g(3) — 7. 

If the estimate value is calculated for this case, since the number of 
characters is 3, Eq. 3 yields 

2585 + 3 X (5 X f(l) + 10 x g(3)) = 2585 + 3 x (5 x 1 + 10 x 7) = 2810. 

Similarly, if the estimate value is calculated for the character string 
20 "hut", the sum of the exactitude 858 for the letter "h", the exactitude 783 for 
the letter "u", and the exactitude 888 for the letter "t" is 2529, and adding 3 x 
(5 X f(l) + 10 X g(3)) = 3 X (5 X 1 + 10 X 7) = 225 yields 2754. 

Calculating the estimate value for all character strings as shown in 
Fig. 16, the character string "hat", which has the highest estimate value, will 
25 be outputted as the recognized character string. 

Next, the case is examined that the user performs a correction with 
regard to a recognized result. This might be the case, when the entered 
character string is not affirmed as "hat" but as "hut". 

In this case, the contents of the situation— optimized dictionary 74 
30 changes as shown in Fig. 17 at the time when "hut" is affirmed. This means 
that the date of the last access for the character string "hut" becomes 
January 6, 1999, and the number of accesses is increased to "2". 

If in this situation, the character pattern of Fig, 13 is entered again, 
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and the recognition result is as shown in Fig. 15, calculating the estimate 
values for each character string as above, gives the following result. 

Firstly, if the estimate value is calculated for the character string 
"hat" as above, the sum of the exactitude 858 for the letter "h", the exactitude 
5 839 for the letter "a", and the exactitude 888 for the letter "t" is 2585, and 
adding 3 x (5 x f(l) + 10 x g(3)) = 3 x (5 x 1 + 10 x 7) = 225 yields 2810. 

On the other hand, for the character string "hut", the sum of the 
exactitude 858 for the letter "h", the exactitude 783 for the letter "u", and the 
exactitude 888 for the letter "t" is 2529, and adding 3 x (5 x f(2) + 10 x g(0)) = 

10 3 X (5 X 2 + 10 X 10) = 330 yields 2859. 

Consequently, as above, if the estimate values for all character 
strings are calculated as shown in Fig. 18, it is determined that the character 
string "hut" has the highest estimate value, and therefore "hut" is outputted 
as the recognized character string. 

15 The reference of the situation— optimized dictionary 74 in the 

candidate character string generation part 73 is not limited to the case 
where the entire character string is matching, but it is also possible to 
extract partially matching character strings as recognized candidate 
character strings. This is because it cannot be guaranteed that the 

20 character strings are entered in the units of the character strings that are 
stored in the situation-optimized dictionary 74. 

Moreover, if the result of the reference to the situation— optimized 
dictionary 74 by the candidate character string generation part 73 is that 
there is no corresponding character string among the character strings 

25 stored in the situation— optimized dictionary 74, it is possible to perform a 
candidate character string determination process using a pre— existing 
lexicon or a conventional N— gram statistical dictionary, and to output the 
result as the optimal pattern recognition result. 

The recognized candidate character string that is decided by the 

30 candidate character string generation part 73 as the pattern recognition 
result is outputted to the display part 79. Then, if the user judges the 
recognized candidate character string displayed by the display part 79 to be 
faulty, the user can perform a correction process with the candidate 
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character string affirmation part 77. 

Here, "correction process" means that the recognized candidate 
character strings are displayed in the order of their calculated estimate 
values, and the user either selects the correct character string as the 
5 recognized character string or enters a new pattern to be recognized anew. 
For example, if, as shown in Fig. 19(a), the character string "reichou" (which 
is not a vaUd Japanese word, and which consists of the two Japanese kanji 
characters "rei" and "chou") has been recognized, it is possible, as shown in 
Fig. 19(b), to select the character "sha", which has a high exactitude. Also, 
10 as shown in Fig. 20(a), if unintentionally the character string "reichou" 
(which is not a valid Japanese word) has been recognized as in Fig. 19, it is 

fij 

; Qi possible to select, for example, the "reenter" button, whereby the recognized 

jt; candidate character "rei" is deleted, and correction is possible by entering a 

''4 new handwritten character string. It should be noted that there is no 

IS: 15 limitation to these processing methods, but any method can be employed 

"^^^ with which the correct pattern is recognized that user had intended to enter, 

u If the user judges that the outputted recognized character string is 

]^ correct, the user can perform the afifirmation process with the candidate 

'ill 

iyi character string affirmation part 77 in the dictionary managing part 80. 

20 Here, "affirmation process" means that the user gives an instruction to the 
effect that he affirms the recognized character string. Moreover, there is no 
particular limitation to methods where the user undertakes a declaratory 
affirmation instruction, and it is also possible that the affirmation is taken to 
be made when the user goes on to make the next input without undertaking 

25 any correction process, or that the outputted character string is taken to be 
affirmed when switching to another application. 

For example, if the handwritten character string "shachou" has been 
recognized correctly as "shachou", as shown in Fig. 21, the user can affirm 
the recognized character string declaratorily by pressing an "adopt" button 

30 or the like. If, as shown in Fig. 22, the entire sentence including the 
recognized result of the handwritten characters "shachou" is listed and 
displayed, the displayed character string "shachou" is regarded as an 
affirmed recognized character string, if another character on the display is 
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corrected or a new character is entered. 

Then, if the affirmation process for affirming a recognized character 
string is preformed with the candidate character string afELrmation part 77, 
the affirmed recognized character string is stored by the affirmed character 
5 string storing part 78 in the situation— optimized dictionary 74. Fig. 11 
shows an example of a situation-optimized dictionary 74. In the situation- 
optimized dictionary 74 in Fig. 11, character strings, the date when the 
character strings have been accessed for the last time, and the number of 
times that they have been accessed are stored. The date when the 

10 character strings have been accessed for the last time, and the number of 
times that they have been accessed are renewed through the date renewal 
part 81 and the recurrence renewal part 82, and stored in the situation- 
optimized dictionary 74 through the affirmed character string storing part 
78. In addition to the date, it is also possible to include the time that 

15 character string has been accessed last. 

For example, if the character string "shachou" is affirmed as the 
recognized character string by performing a correction process with respect 
to the recognized candidate character string shown in Fig. 21 with the 
candidate character string affirmation part 77, the affirmation character 

20 string storing part 78 stores the word "shachou" in the situation— optimized 
dictionary 74. If the character string "shachou" has previously not been 
stored in the situation— optimized dictionary 74, a new storage is performed, 
and if it has previously been stored, the date of the last access is changed to 
the current date, and the access recurrence is increased by "1". 

25 Moreover, based on the dates when the character strings have been 

accessed for the last time, it is also possible to perform a candidate character 
string determination process giving priority to character strings that have 
been accessed recently, or giving priority to character strings with a high 
access recurrence. It goes without saying that it is also possible to combine 

30 these two factors, and to calculate an estimate value by weighting the 
priorities. 

Moreover, when storing the recognized character string affirmed by 
the user in the situation— optimized dictionary 74, it is also possible to store, 
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of all affirmed recognized character strings, only recognized character 
strings including a part for which the user has performed a correction 
process. Since it can be expected that the next time that the same pattern 
as for a recognized character string that has been affirmed without 
5 performing a correction process is entered, the character string will be 
recognized correctly again, there is no necessity to store it in the situation- 
optimized dictionary 74. Consequently, by not storing such recognized 
character strings in the situation— optimized dictionary 74, the number of 
stored character strings in the situation— optimized dictionary 74 can be 

10 reduced, which makes it possible to decrease the size of the situation- 
optimized dictionary 74. 

Moreover, if a plurality of users share the same terminal, it is 
possible to provide user-specific situation-optimized dictionaries 74, in 
correspondence to, for example, a user ID that is entered when the user 

15 changes. This makes it possible to allow for a pattern recognition 
accommodating the usage of individual users. 

With the second embodiment as described above, in response to the 
access situation of the user, the situation— optimized dictionary can be 
adapted to a content that is optimal for the actual situation of the user, and 

20 even if the input is handwritten, it is possible to execute a pattern 
recognition with a high recognition rate, and to enter the character strings 
intended by the user. 

The following is an explanation of the process flow of a program for 
realizing the second embodiment of a device for entering a character string 

25 in accordance with the present invention, with reference to Fig. 23. 

In Fig. 23, if the user enters a pattern (step S231), for example 
handwritten characters or voice input, a pattern recognition is performed for 
the entered pattern (step S232), and the recognized candidate character 
string (or a combination of recognized candidate character strings) is looked 

30 up in the situation-optimized dictionary (step S233). 

If the recognized candidate character string is in the situation- 
optimized dictionary, an estimate value is calculated, which comprehensively 
evaluates the exactitude of the recognized candidate characters, the date of 
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the last access, and the access recurrence (step S234). Then, the most 
suitable recognized candidate character string is given out (step S235). If 
the recognized candidate character string is not in the situation— optimized 
dictionary, a combination of recognized candidate characters is pretended to 
5 be the recognized candidate character string. 

Then, after the user has performed, for example, a correction of the 
recognized candidate character string, the affirmation of the recognized 
candidate character string is processed (step S236). Then, in order to apply 
the results in future processes, the affirmed recognized character string is 

10 stored in the situation— optimized dictionary (step S237). 

Fig. 24 shows examples of recording media storing a program for 
realizing an embodiment of the device for entering a character string in 
accordance with this invention. As shown in Fig. 18, such recording media 
can be not only portable recording media 242 such as a CD— ROM 242—1 or a 

15 floppy disk 242-2, but also a memory device 241 connected by a 
communication circuit, or any other recording device 244 such as a 
computer's hard disk or a RAM. To execute the program, the program is 
loaded and executed on the main memory. 

Moreover, as shown in Fig. 24, the recording media storing the 

20 situation— optimized dictionary generated with an embodiment of the device 
for entering a character string in accordance with this invention can be not 
only portable recording media 242 such as a CD-ROM 242—1 or floppy disk 
242—2, but also a memory device connected by a communication circuit, or 
any other recording device 244 such as a computer's hard disk or a RAM, 

25 which can be read into a computer 243 when the device for entering a 
character string in accordance with this invention is used, for example. 

The invention may be embodied in other specific forms without 
departing from the spirit or essential characteristics thereof. The 
embodiments disclosed in this appUcation are to be considered in all respects 

30 as illustrative and not restrictive, the scope of the invention being indicated 
by the appended claims rather than by the foregoing description, all changes 
that come within the meaning and range of equivalency of the claims are 
intended to be embraced therein. 
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